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Abstract 

A recent result of Eden, Levi, and Ron (ECCC 2015) provides a sublinear time algorithm to 
estimate the number of triangles in a graph. Given an undirected graph G, one can query the 
degree of a vertex, the existence of an edge between vertices, and the *th neighbor of a vertex. 
Suppose the graph has n vertices, m edges, and t triangles. In this model, Eden et al provided a 
0(poly(e“^ logn)(n/n/^ time algorithm to get a (H-£:)-multiplicative approximation 

for t, the triangle count. This paper provides a simpler algorithm with the same running time 
(up to differences in the poly(e“^ logn) factor) that has a substantially simpler analysis. 


1 Introduction 

Counting the number of triangles in a graph is a fundamental algorithmic problem. It has been 
studied widely in theory and well as practice. With the recent study of complex networks and 
massive real-world graphs, triangle counting is a key operation in graph analysis for bioinformatics, 
social networks, community analysis, and graph modeling [HL70, C0I88, EM02, Por98, MSI^02, 
Bur04, BBCG08, WDCIO, BHLPll, SKP12]. 

There has been much study on triangle counting by the theoretical computer science community, 
where the primary hammer is fast matrix multiplication [IR78, AYZ97, BPWZ14]. On the more 
applied side, there is a plethora of provable and practical algorithms that employ clever sampling 
methods for approximate triangle counting [CN85, SW05b, SW05a, Tso08, TKMF09, KMPTIO, 
AvrlO, ecu, SVll, TKMll, AKM12, SPK13, TPT13]. Triangle counting has also been a popular 
problem for the streaming setting [BYKS02, JG05, BFL'''06, AGM12, KMSS12, JSP13, PTTW13, 
TPT13, ADNK14]. 

Yet all these algorithms always read the entire graph. A recent result of Fden, Levi, and 
Ron [FLR15] provided the first truly sublinear algorithm. For a graph with n vertices, m edges, 
and t triangles, their running time is 0(poly(e“^ logn)(n/U/^ -|- m^/^/t)). Observe that when 
t = (for constant (5 > 0), this algorithm does not even read the entire graph. They also 

prove a lower bound, tight up to the poly(e“^ log n) factor. The algorithm and analysis are quite 
intricate, and require a number of ideas. This paper provides a simpler algorithm with a simpler 
analysis. 

Let us explain the formal result. The input is an undirected graph G = {V,E), stored as an 
adjacency list. We set V = [n], where n is known to the algorithm. The following queries are 
allowed: 

• Degree queries: for any v £ V, we can get dy, the degree of v. 

• Neighbor queries: for any v G V and i < dy, we can get the ith neighbor of v. 

• Fdge queries: for any u,v £ V, we can check if (u, v) £ E. 
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We denote the number of edges by m and the number of triangles by t. Our main result is the 
following (identical to that of [ELR15]). 

Theorem 1. There exists an algorithm with the following guarantee. Given query access to graph 
G and an approximation parameter e > 0, the algorithm outputs an estimate for t in the range 
[(1 — e)f, (1 + e)t] with probability at least > 2/3. The expected running time of the algorithm is 
0 (poly(e“^ logn)(n/t^/^ + /t)). 

We can easily boost the success probability by repeating and taking the median estimate. The 
query complexity can be made 0(poly(e“^ log n) min(m, As we shall see, the 

algorithm begins by quickly obtaining an accurate estimate of m. If the algorithm ever makes more 
queries than m, we can pause, query the entire graph and store it. Now, we run the remainder of 
the algorithm on this stored version. 

1.1 Previous work 

For the sake of clarity, we suppress any dependences on the approximation parameter e or on log n 
using the notation 0*{-). The result of Eden, Levi, and Ron [ELR15] (denoted ELR) can be traced 
back to ideas of Feige [FeiOG] and Goldreich and Ron [GR08] , who provided sublinear algorithms for 
estimating the average degree. Feige shows that averaging the degree of 0*{nfy/m) uniform random 
vertices suffices to give a (2 + e)-approximation of the average degree. His primary tool was a new 
tail bound for sums of independent random variables with unbounded variance. Goldreich and Ron 
subsequently give (Feige also observes this in the journal version) a substantially simpler proof, and 
the cost of a worse dependence on e. Extending the analysis, they provide a (1 + e)-approximation 
with the same query complexity when the algorithm is also allowed neighbor queries. Gonen, Ron, 
and Shavitt [GRSll] build on these ideas to count stars, and ELR goes further to count triangles. 

We note that there is a signihcant leap required to perform triangle counting. Since the query 
model allows the algorithm to directly get degrees, it is plausible that one can estimate moments 
of the degree sequence. Triangle counting is a much harder beast, since mere knowledge of degrees 
provides no (obvious) help. 

The high level approach in ELR, building on Goldreich-Ron and Gonen et al, is vertex bucketing. 
We bin the vertices in O(logn) buckets, by rounding their triangle count (or degree, in previous 
work) down to the nearest power of (1 + e). With respect to the buckets, each edge or triangle 
has only poly(logn) different possibilities, or “types”. So we can apply concentration bounds for 
each type of edge/triangle, take a union bound over the different types. Of course, the challenge is 
now in counting triangles of a given type. There are numerous issues caused by small buckets and 
determining the type of a triangle, which lead to a fairly complex algorithm and analysis. 

1.2 Main ideas 

Our approach goes via a different route. Start with the exact triangle counter of Ghiba and 
Nishizeki [GN85]. Simply enumerate over all edges, and count the number of triangles te containing 
each edge e. For edge e = (u, u), it suffices to compute the intersection of neighbor lists of u and 
V. This can be done in m.m{du, dy) time, leading to an overall bound of X^e=(u w) 
bounds on the arboricity (or degeneracy) of graphs, this expression can be bounded by 0(m^/^). 
How about implementing this algorithm using sampling? 

Pick a u.a.r. (uniform at random) edge e and let u denote the endpoint of lower degree. Pick 
a u.a.r. neighbor w of u. If e and w form a triangle, set X = dy Else, X = 0. The expected 
value of X is precisely 3t/m, so one hopes that 0{m/t) samples suffices to get concentration. But 
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the variance of X could be too large, so we simply use more samples for “high variance edges”. 
After picking e with endpoint u, if du is large, we sample more neighbors wi,W 2 , - ■ ■ and take the 
average of the corresponding X values. We set the number of these neighbors to be dujy/m, and 
can show that var(Ai) < y^E[X]. So jt') samples suffice for concentration. But the time 

per sample has potentially gone up from constant to expected m~^ Yle=(uv) d„)/-vA^- By 

Chiba-Nishizeki’s bound, this is still 0(1) and we are done! 

Our problem is not yet solved, because we need to sample a uniform random edge, a query that 
is not allowed. We can sample some set S of vertices, query all their degrees, and can sample a u.a.r. 
edge incident to S in 0(1) time. We could perform the above algorithm on edges incident to S, but 
we required the number of triangles incident to S to concentrate well. This is suspiciously similar to 
the average degree questions of Feige and Goldreich-Ron. Using a stripped-down version of Feige’s 
analysis, we can argue that sampling 0(n/t^/^) vertices is enough, with a painful caveat. This 
only provides a 3-approximation to t (analogous to Feige’s 2-approximation for average degree). 
To get down to (1 -|- e), we need to sample a uniform triangle incident to S, determine the number 
triangles incident to these vertices, and then weight the triangle accordingly. (ELR also perform 
the sampling of vertices and explicitly move from the 3 to (1 + e)-approximation. In this 

analysis, we go straight to the (l + e)-approximation.) The number of triangles incident to a vertex 
V can be estimated by the algorithm in the previous paragraph, by sampling edges incident to v. 
With some care, all of this can be put together in + m jt) time. 

As an aside, we also give a simpler analysis of a (1 -|-e)-approximation 0*{n/y/m) algorithm for 
the average degree, first shown by Goldreich-Ron [GR08]. We defer this to the end of the paper. 

2 Preliminaries 

We use dy for the degree of vertex v, 6{v) for the set of edges incident to v, and F(n) for the 
neighborhood of v. We use tg for the number of triangles incident to edge e, and set = Yle&5(v) 
Note that the latter is twice the number of triangles incident to v. The set of triangles incident to 
e is Te- We use c, ci,... to denote sufficiently large constants. 

Our initial description of the algorithm will use the value of m and t to decide how much to 
sample. This (somewhat circular) assumption is easily removed by doing a geometric search on m 
and t, and is explained at the end of our proof. We use e to denote the approximation parameter. 

3 Heavy and light vertices 

Roughly speaking, a heavy vertex is one with either high degree or many triangles. By “high”, we 
mean in the top values. For the main algorithm, it will be important to distinguish such heavy 
vertices efficiently. 


3 


heavy(?;) 

1. If (i„ > 2m/output heavy. 

2. Repeat for i = 1,2,..., clog n: 

(a) Repeat for j = 1, 2,..., (4/e^)(m^/^/t) = s: 

i. Select u.a.r. edge e € 6{v), and let u be endpoint with smaller degree. 

ii. Repeat for A: = 1, 2,... , : 

A. Pick u.a.r. neighbor w of u. 

B. If e with w forms a triangle, set else = 0. 

hi. Set Yj = Y^k^k/\du/Vm]. 

(b) Set Xi = dv ^'/s- 

3. If median of XiS is greater than output heavy, else output light. 

We have three nested loops, with loop variables i,j,k respectively. We reference these as 
“iteration z”, “iteration j”, and ’’iteration /c”. 

Lemma 2. For any iteration i, Pr[|W — t^\ > e max{t^, tdy/m)] < 1/4. 

Proof. Fix an iteration j, and let ej denote the edge chosen in the jth iteration, with uj as the 
smaller degree endpoint. We use Sj to denote the event of ej being chosen. The probability 
of hnding a triangle in any iteration k is tejduj- Hence, E[Zfc|Tj] = {tf,Jduj) ■ dy. = t^,, and 
var(Zfc|Tj) < E[Z||Tj] < dujFi[Zk\£j\. By linearity of expectation, E[l^|£’j] = te and by indepen¬ 
dence, var(Y^|Tj) < E[l^]/|'d„./-y/m] < y^E[Y,|Tj]. The conditioning can be removed to yield 

EK'] = Yje£5(v) ^ejdy = tyjdy and var(yj) < y/mE[Yj]. 

By Chebyshev’s inequality on E = Pr[|y — t^/d^| > s max{ty / dy, t / m)] is at most 


var(y) 


< 


y/fn{tyldy) 


Taayi{ty/dy,t/mY e^(A:ljt) ■ ityjdy) ■ {tim) 


= 1/4 


□ 


Lemma 3. The following hold with probability > 1 — 1/n over all calls o/ heavy. If v is declared 
light, then ty < 2t^/^/e^/^. If v is declared heavy, then dy > 2m/{et)^/^ or ty > t^/^/2e^/^. 

Proof. It is more convenient to prove the contrapositive statements. Obviously, if dy > 2m/(et)^/^, 
then V is heavy. So assume that dy < 2m/(et)^/^. Then, tdy/m < 2t^/^/e^/^. Suppose ty > 
2t^/^/e^/^. By Lemma2, for any iteration i, Pr[|Aij — ty\ > ety] < 1/4. By a standard Chernoff 
bound, the median of the clogn XjS will be greater than with probability at least 1 — 1 /n^, 

and V is declared heavy. 

Suppose ty < t^/^/2e^/^. By Lemma2, Pr[|Aij — ty\ > e(2t^/^/e^/^)] < 1/4. By Chernoff again, 
the median will be less than with probability at least 1 — 1/n^, and v is light. A union 

bound over all v completes the argument. □ 

Obviously, there is an upper bound on the number of vertices with either high degree or many 
triangles. 

Corollary 4. The number of heavy vertices is at most 3{st)^^^. 

The following is where the degeneracy bounds come into play. We give a direct, self-contained 
proof, but note the connection to Chiba-Nishizeki’s bound. 

Lemma 5. The expected runtime o/heavy(u) is /t). 
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Proof. It suffices to argue that the expected time to generate a single sample of Yj is 0(1). Our 
query model allows for selecting a u.a.r. edge in 5{v) by a single query. If dy < y/m, then the degree 
of smaller endpoint for any e € 5{v) is at most y/rri. So a sample is clearly generated in 0(1) time. 
Suppose dy > y/m. If edge e = {v,w) is sampled, the runtime is 0(1 + m.m{dy,dy,)/^/m). Hence, 
the expected runtime to generate Yj is, up to constant factors, at most: 


d^^ ^ (1 + m m.m{dy,dyy)) = 

ui€r(u) 


< 

< 


1 + {\/rndy) ^ ^ dyy + {y/rndy) ^ ^ dy 

iHEr('u) 

dw ^dy dyj >d 

V 

1 + m~^ + m ^^‘^\{w\dyy > dy}\ 

W 

3 + m~^^^{2mfdy) = 0(1) 


□ 

We will assume that the random coins used by heavy are fixed in advance, and that heavy/light 
vertices satisfy the condition of Lemma 3. This allows us to treat the heavy/light labeling as 
deterministic. By a union bound, it only adds 1/n to the errors in all subsequent probability 
bounds. 


3.1 The utility of heavy vertices 

Let L and H denote the set of light and heavy vertices. For every triangle A, we associate a weight 
depending on the number of light vertices in A. We set wt(A) = 0 if A has no light vertices, and 
wt(A) = l/2£, if A has i ^ 0 light vertices. 

Lemma 6. ^ 

Proof. Define indicator x(e. A) for triangle A containing edge e. Consider a triangle A that contains 
^ / 0 light vertices. Then Edgl Ylee5{v) x(C) is exactly 2i, which is l/wt(A). li £ = wt(A) = 0, 
this expression is 0. By interchanging summations, 

Y wt(Te) = ^wt(A)^ Y x(e,A) = t-|{A|wt(A) = 0}|. 

vGL eGS{v) A vGL e^S{v) 


But the number of triangles with weight 0 is at most which by Corollary 4 is at most det. □ 

We define wt(T) = EAeT^i'(^) ^ oi' triangles. Abusing notation, define wt(u) = 

EeG<5(i;) '''^t(Te) for u € L, and wt(u) = 0 for u € LI. 

Theorem 7. Let s > {clog{n/e)/e^)n/t^^^. Sample s u.a.r. vertices vi,V 2 ,... ,Vs. Then'E['^-^^ wt{vi)/s] E 
[f(l — 9s)/n, t/n] and Pr[Ej<s wt{vi)/s < f(l — 10e)/n] < e^/n. 

Proof. The expectation holds by Lemma 6. Let Y denote the random variable Yli<s wt(ui)/s. Note 
that wt(u) < ty, which by Lemma3 is at most A multiplicative Chernoff bound tells us 

that Pr[y < E[y](l — e)] < exp(—EsE[y]/3(2t^/^/e^/^)). Algebra on the exponent: 


e^sE[y] ^ clog(n/e)(n/et^/^) • t/(2n) 
6^2/3/ei/3 - 6t2/3/el/3 


D(log(n/e)) 


□ 
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4 The full algorithm 


We come to the main algorithm. It is convenient to dehne a procedure estimate uses the values of 
m and t, and only has a lower tail bound. Later, we employ Markov and geometric search to get 
the bonafide algorithm that estimates t. 

estimate 

1. Sample si = cie~^log{n/e)nt~^^^ u.a.r. vertices. Call this multiset S. 

2. Set up a data structure to sample vertices in S proportional to degree. 

3. Repeat for i = 1, 2,..., S 2 = C 2 e“‘^(log^ times: 

(a) Sample v G S proportional to dy and sample u.a.r. e € 6{v). Let u be lower 
degree endpoint. 

(b) If du < y/m, set i? = 1 with probability dy/y/m and set R = 0 otherwise. If 
du > y/m, set R = \du/^/m]. 

(c) Repeat for j = 1,2,R: 

i. Pick u.a.r. neighbor w of u. 

ii. If e with w does not form triangle, set Zj = 0. 

hi. If e with w forms triangle A: call heavy for all vertices in A. If u is heavy, 
set Zj = 0. Otherwise, set Zj = max{du, ■yrn)wt(A). 

(d) Set Yi = Zj/R. (If R = 0, set Yi = 0.) 

4. Output A = n{Y,y^sdv)h2i^i)/siS2 

Theorem 8. E[A] S [t(I — 2e),t] and Pr[A < t{l — 20e)] < 3e/logn. 

Proof. There are three “levels” of randomness. First is the choice of S, the second level is the 
choice of e (Step 3a), and finally the ZjS. For analyzing the randomness in any level, we condition 
on the previous levels. It is helpful to break the proof up using claims. 

Claim 9. Condition of some S being chosen, and let ds = nyi\s] = d-s^Y.v^s^Kv) 

and var(yjl5) < y/rnEi\Yi\S]. 

Proof. This is similar to the argument in Lemma 2. Let vertex Vi and edge e, with lower degree 
endpoint Ui be chosen in iteration i. We simply refer to this event by £i, so the conditioning is over 
S and Si. (We use Ijr is the indicator for event F.) 

If dui < y/m, E[y|5,£^i] = {dui/y/rn)d~lly^(zLY,^(iT, a/w • wt(A) = l^.ei;,wt(rej. Since the 

i e.£ 

maximum value of Y in this case is at most y/m, var(y|5,£’*) < ^/rrlEI\Y\S,£i]. 

We prove the same bound if > y/m. So 'Ei[Zj\S,£i] = (i„;Wt(A) = 

1-vi&L^t{Te/j . As before, Yai{Zj\S,Si) < dufYj[Zj\S,Si]. By linearity of expectation E[y|S',£^j] = 
l^.gLwt(reJ. By independence of the {Zj\S,Si) variables, var(yj|S',< y/mEi\Yi\S,Si]. We re¬ 
move the conditioning on Si'. 

E[yi|5] = X] ^ ^ ^ ^t{Te/j^y^^L = ^ wt(u) 

vdS ^ ^ eGS(v) v£S eGS(v) v£S 

We also have var(yj|5) < y/mE\Yi\S]. □ 

Hence, E[A|S'] = nd5E[yi|5]/si = wt(u)/|S'|. By Theorem?, the expectation over S 

(which yields E[A]) is in [t(I — 9e),t]. 

We will call S good if wt(u)/si > t(I — 10e)/n. By Theorem?, this happens with 

probability at least 1 — e^/n. We call S great if, in addition to being good, ds = — 
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si{2m/n){\ogn)/£. The expected value, over S', of dg is si(2m/n). By the Markov bound and the 
union bound, the probability that S is great is at least 1 — ^ejXogn. 

We apply Chebyshev’s inequality onY = Yijs^-, conditioned over S. We get that Pr[|y|S — 

E[y|S]| > eE[y|S]] is at most 

var(y|S) ^ y/mEi\Y\S] 

e2E[y|S]2 - e2(c2e-4 log^ n)m3/2t-iE[y|S]2 “ C2(log2 n)(m/t)E[y|S] 

Note that E[y|S] = Sijgs wt(u), which for great S is at least (t/4n)(logn/e). Hence, Pr[|y|S — 
E[y|S]| > eE[y|S]] < e/logn. Conditioned on S, X is just a scaling of Y. So we get Pr[|X|S — 
E[X|S]| > £E[y|S]] < e/logn. Note that E[y|S] = n'Y^^^gWt{v)/si, which for great S is at 
least t(l — lOe). Hence, for great S, Pr[X|S < t(l — 20e)/n] < e/logn. The probability of S 
not being great is at most 2e/logn. We apply the union bound to remove the conditioning, so 
Pr[y < t(l — 20e)/n] < 3e/log n. □ 

Theorem 10. The expected running time o/estimate is 0*{n/t^/^ -\- jt). 

Proof. The sampling of S is done in 0*{n/t^^^) time. Let us compute the expected number of 
triangles found. In iteration i, we are basically picking a u.a.r. edge of G. Conditioned on choosing 
edge e, the expected number of triangles found is at most 2{du/y/rn){te/du) = 2tely/m. Averaging 
over edge e, the expected number of triangles found in a single iteration is at most 2tjm^^‘^. There 
are /t) iterations, leading to grand total of 0*(1) expected triangles. Thus, there are 0*(1) 

expected calls to heavy, each taking /t) time by Lemma 5. 

The time required to generate the ZjS is also /t) by an argument identical to that in 

the proof of Lemma 5. □ 

Theorem 11. There exists a 0*{n/t^/^ + rr?l‘^ jP) algorithm that provides an estimate for the 
triangle count in [(1 — e)t, (1 + e)t] with probability at least > 5/6. 

Proof First, we need to estimate m. This can be done with suitably high probability by the 
algorithm of Goldreich-Ron [GR08] in 0*{nfy/m) time. (We give an independent proof in the next 
section of Theorem 13, which gives the desired algorithm.) Since t < this can be absorbed in 

the 0*{n/t^/^) bound. 

We perform a geometric search for t, by guessing its value as n^, n^/2, n^/2^,.... We run the 
procedure of Theorems independently ce“^loglogn times, and take the minimum estimate. By 
Markov Pr[A < (1 + e)t] > e/2. With probability at least 1 — 1/log^n, the minimum estimate is 
at most (1 + e)t. 

If the estimate is larger than the current guess of t, we halve the guess for t. Eventually, we 
reach an appropriate guess where the estimates of Theorem 8 to kick in. At the stage, the minimum 
of ce~^ loglogn estimates is at most (l + e)t and at least (1 —e)t with probability at least 1 —1/logn 
(we rescale e from Theorems). A union bound over all errors completes the proof. □ 

5 A (1 + £)-approximation for the average degree 

We impose a total order -< on the vertices, where u ^ v du < dy or, dy = dy and the ID of u is 
less than that of v. (The latter is just an arbitrary but consistent tie-breaking rule). We use df to 
be denote the number of neighbors of v “higher” than v according to 

Define random variable X as follows. Pick u.a.r. vertex v, then pick u.a.r. vertex rr £ r(u). If 
V < u, X = 2dy, else A = 0. We use d to denote 2m/n, the average degree. 
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Theorem 12. Suppose s > [clog(e ^logn)e ‘^\nl^era. Let Xi,...,Xs be i.i.d. as X (defined 
above), and X = Y)i ^i/s- Then E[X] = d and Pr[X < (1 — £)d] < 1/e^ log^ re. 

Proof. The expectation bound is direct. E[X] = n~^ /dv) • 2d„ = 2^^(i+/re = d. Let 

vi,V 2 , ■ ■ ■ ,Vs be the u.a.r. vertices corresponding to Xi,...,Xg. Set k = y/em, so let Sk denote the 
highest k vertices, according to First, some properties of Sk- For all v G Sk, dfi < k. On the 
other hand, for all v ^ Sk, dy < 2m/k. (If not, the total sum of degrees in Sk exceeds 2m.) 

Define random variable as: Yi = Xj if Vi ^ Sk and 0 otherwise. Denoting Y = 
note that X > T, so it suffices to provide a lower tail for Y. Observe that E[y] = 2re“^ Ylv^Sk 

= 2n~^{m — YlveSk ^ 2n~^{m — k"^) > (1 — £)d. 

We apply a standard multiplicative Chernoff bound on T = Yi/s, noting that Yi G [0, 2m/k]. 
So Pr[y > (1 — e)E[y]] < exp(—e^sE[y]/3(2m/A:)). and going through the motions, 

e^sE[y]/3(2rre//c) > c(l — e) log(e“^ logre)dre/6m = D(log(e“^ log re)) 

By a sufficiently large choice of c, Pr[y > (1 — e)^d] < 1/e^log^re. Rescale e to complete the 
proof. □ 

To get an upper tail bound, we simply use Markov on X. Of course, we do not have a bonafide 
algorithm, since the value of m is required to choose s. But a simple geometric search for m wraps 
up the whole proof. This is a repeat of the proof of Theorem 11. 

Theorem 13. There exists a log{£~^ log n)n/y/m) algorithm that provides an estimate in 

[(1 — £)d, (1 + £)d] with probability > 5/6. 

Proof. We perform a geometric search for m, by guessing its value as re^, re^/2, re^/2^,.... Consider 
the estimate of Theorem 12 (for some guessed value of rre). By Markov, Pr[X < (1 + e)d] > e/2. 
Basically, we run the procedure of Theorem 12 independently ce“^loglogre times and take the 
minimum estimate. With probability at least 1 — 1/log^re, the estimate is at most (1 + £)d. 

This yields an estimate for m as well. If the estimate is larger than the current guess, we halve 
the guess for rre. Eventually, we reach a guess such that s is large enough (in Theorem 12). At 
the stage, the minimum of ce~^ log log re estimates is at most (1 + e)d and at least (1 — £)d with 
probability at least 1 — 1/log re. A union bound over all errors completes the proof. □ 

We note that this proof is very similar to Feige’s (2 + £)-approximation. His proof is more 
involved and uses tighter arguments to get the dependence on e down to e~^. 
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